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The Wilkinson Microwave Anisotropy Probe (WMAP) mapped the distribution of 
temperature and polarization over the entire sky in five microwave frequency bands. 
These full-sky maps were used to obtain measurements of temperature and polariza- 
tion anisotropy of the cosmic microwave background with the unprecedented accuracy 
and precision. The analysis of two-point correlation functions of temperature and polar- 
ization data gives determinations of the fundamental cosmological parameters such as 
the age and composition of the universe, as well as the key parameters describing the 
physics of inflation, which is further constrained by three-point correlation functions. 
WMAP observations alone reduced the flat A cold dark matter (ACDM) cosmologi- 
cal model (six) parameter volume by a factor of > 68,000 compared with pr e-WMAP 
measurements. The WMAP observations (sometimes in combination with other astro- 
physical probes) convincingly show the existence of non-baryonic dark matter, the cosmic 
neutrino background, flatness of spatial geometry of the universe, a deviation from a 
scale-invariant spectrum of initial scalar fluctuations, and that the current universe is 
undergoing an accelerated expansion. The WMAP observations provide the strongest 
ever support for inflation; namely, the structures we see in the universe originate from 
quantum fluctuations generated during inflation. 

Subject Index xxxx, xxx 


1. Introduction 

The WMAP [1] spacecraft was designed to measure the full-sky distribution of tempera- 
ture differences (anisotropy) and polarization of the cosmic microwave background (CMB). 
WMAP is the successor of the legendary Cosmic Background Explorer ( COBE ) satellite, 
whose spectrograph provided a precision-measurement of the CMB blackbody, implying 
that matter and radiation were in thermal equilibrium, consistent with the expectation of 
the hot Big Bang theory of the universe [2], The COBE differential radiometers discovered 
the primordial ripples in spacetime that existed in the early universe [3]. With 35 times 
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better angular resolution and 40 times better sensitivity than COBE , WMAP took the 
cosmological research with CMB to the next level. 

WMAP was proposed to NASA as a MIDEX (Medium-Class Explorers) mission in 1995. 
Four of eight Co-Investigators 1 of WMAP were previously on the COBE Science Team. After 
being selected in 1996, WMAP launched on June 30, 2001, and arrived in its orbit around 
the second Lagrange point (L2), 1.5 million kilometers from Earth, three months later. Since 
then, WMAP operated almost flawlessly for nine years until it left its L2 orbit on September 
8, 2010, to pass the baton to its successor, the Planck satellite, which arrived at L2 in July 
2009. 

The WMAP team issued five data releases. The first-year data release (February 11, 2003) 
came with 13 papers [4-16] and later with one more paper [17]; the third-year data release 
(March 16, 2006) came with 4 papers [18-21] and later with one more paper [22]; the five-year 
data release (March 5, 2008) came with 7 papers [23-29] and later with one more paper [30]; 
the seven-year data release (January 25, 2010) came with 6 papers [31-36]; and the final, 
nine-year data release (December 21, 2012) came with 2 papers [37, 38]. In addition, detailed 
descriptions of the mission, data processing, calibration, as well as of the data products for 
each release are given in the Explanatory Supplement document [39]. 

In this article, we give a brief review on the WMAP experiment, the data analysis, and 
the main science results from the nine- year observations. 

2. How WMAP measures temperature and polarization 

2. 1 . Temperature 

WMAP measured the distribution of temperature and polarization of the entire sky in five 
frequency bands (23, 33, 41, 61, and 94 GHz). Five frequencies were necessary to separate the 
CMB from emission of our own Galaxy, including synchrotron, free-free, and dust emission 
(see section 3 for how to reduce the effect of the Galactic emission). 

WMAP has two back-to-back mirrors to focus the incoming electromagnetic waves arriving 
from two different lines of sight, which are separated by 141° from each other. The size of 
each mirror is 1.4 mxl.6 m, providing substantially better angular resolution than COBE, 
which did not have mirrors but only horn antennas. The incoming waves collected by two 
mirrors are received by a pair of feed horns. Let us call these two inputs “A side” and “B 
side.” 

Each of the inputs from the A and B sides is separated into two orthogonal polarized waves 
by Orthomode Transducers (OMTs), and then sent to a pair of radiometers. WMAP has 
20 radiometers, or 10 pairs of radiometers. We call each pair of radiometers a “differencing 
assembly” (DA). There are 1, 1, 2, 2, and 4 DAs at 23, 33, 41, 61, and 94 GHz, respectively. 
Each DA measures temperature and polarization differences between the A and B sides. 

Each of 20 radiometers processes inputs from the A and B sides as follows (section 2.3 of 
[40]) 

1. The inputs are separated into two linear combinations, -^(A + B) and -^(A — B). 


1 The Co-Investigators on the WMAP proposal are C. L. Bennett, G. Hinshaw, N. Jarosik, S. S. 
Meyer, L. Page, D. N. Spergel, D. T. Wilkinson (deceased), and E. L. Wright. Among them, Bennett, 
Hinshaw, Wilkinson, and Wright were previously on the COBE Science Team. 
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Fig. 1 Definition of the Stokes parameters with respect to Galactic coordinates (adapted 
from [35]). “N” and “E” denote the Galactic north and east directions, respectively. 


2. These linear combinations are amplified by High Electron Mobility Transistor (HEMT) 
amplifiers, yielding 
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where ni and n 2 are the noise added by the amplifiers, and g\ and g 2 are the gains of 
the amplifiers. 

3. The latter combination, u 2 , is phased-switched to yield ±u 2 . Then u\ and the phase- 
switched u 2 are combined to form vi = u\ ± u 2 and v r = u\ =F u 2 . 

4. The combined signals are detected by the square-law detectors (diodes). The outputs, 
Vi and V r , are proportional to vf and respectively. 

5. Finally, we compute the difference between V) and V r , obtaining 


Vl-Vr 
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S -(A 2 - B 2 )g l92 , 


(2) 


where s is the proportionality constant of the square-law detector. This quantity is thus 
proportional to the difference between the powers of light coming from the A and B 
sides; i.e. , WMAP measures temperature differences in the sky separated by 141°, and 
the mean CMB temperature (2.725 K) and many types of undesirable systematic effects 
cancel out. 


We need to convert the measured ^(Vi — V r ) (in units of voltages) to a temperature dif- 
ference in thermodynamic units. We do this by using the dipole anisotropy of the CMB. As 
Earth orbits around Sun, the L2 point (hence WMAP ) also orbits around Sun at 30 km/s. 
This motion creates time-varying dipole anisotropy, creating a sinusoidal signal changing 
over a year. As we know the mean CMB temperature and the orbital velocity precisely, we 
know that the amplitude of this signal must be T cm ^v/c = 273 /iK. This fixes the propor- 
tionality constant between i(V) — V r ) and Ta — Tg, where Ta and Tg are the temperatures 
toward the A and B sides, respectively. 


2.2. Polarization 

The polarization information is still entangled in Ta — Tb measured by each radiometer. 
To measure polarization, we need to combine the measurements of a pair of radiometers 
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forming each DA. Let us define d\ = (Ta — Tb ) i and ^2 = {Ta — Tb) 2 , where d\ and cfo are 

the outputs of two radiometers. The action of OMTs gives (equations 10 and 11 of [18]) 

di = Ia + Qa cos 274 + Ua sin 27,4 - I B ~ Qb cos 2^ B ~U B sin 2^y B , (3) 

d 2 = I A - Qa cos 27 A - U a sin 27,4 - Ib + Qb cos 2^ b + U b sin 2j b ■ (4) 

Here, I a, Qa, and Ua are the Stokes parameters describing the incoming waves from the A 
side. We define the Stokes Q and U such that the polarization directions of a pure Q signal 
are parallel to either Galactic longitudes or latitudes, and the polarization directions of a 
pure U signal are 45 degrees tilted from those of a pure Q signal. In other words, a pure Q 
signal aligns with either the Galactic north-south or east-west direction (see figure 1). Then, 
721 is the angle between a meridian through the Galactic poles and the projection of the 
electric held of each output port of the OMTs on the sky. 

The sum and difference of the instantaneous outputs of two radiometers thus yield 

= I A ~ Ib, (5) 

= Qa cos 274 + U a sin 274 - Qb cos 2-{b - Ub sin 2^ B - (6) 

The sum gives the temperature difference (i.e., difference of unpolarized intensities), while 
the difference gives a combination of the Stokes Q and U. 

The polarization angles of radiometers were measured on the ground using a polarized 
source. The uncertainty of the measurement is 1° (section 2.5 of [41]), and the measurements 
are within ±1.5° of the design orientation (section 3 of [20]). In flight, we observe Tau A 
[20, 36], and find that the standard deviation of angles measured in five bands is 0.6° (see 
Table 15 of [36]). This is consistent with (and is smaller than) the scatter of the ground 
measurements and we conservatively use 1.5° as an estimate of the systematic error in the 
polarization angle of WMAP. 

2. 3. Map making 

The instantaneous outputs of two radiometers per DA yield the temperature and polarization 
differences between the A and B sides. The next step is to reconstruct the distribution of 
temperature (minus the mean CMB temperature) and polarization over the entire sky. 

WMAP scans the full sky in six months. We can then estimate maps of temperature and 
polarization over the full sky using the six-month data. We first write down the measured 
time-ordered data (TOD) as (section 3.4 of [18]) 

dt — ^ ] M^pTYip T nt, (7) 

p 

where dt is the TOD of two radiometers, d = (di, c^), measured at a given observation time, 
t; m p is the actual sky map consisting of m = (I, Q , U) at a given sky location (pixel), p; nt 
is noise of the TOD; and Mt p is the so-called “mapping matrix” which projects m p onto dt- 
For an ideal differential experiment, the mapping matrix is a 2 Nt x 3 N p matrix, where 
Nt is the number of data-recording times and N p is the number of sky pixels. Each row 
corresponds to one observation, while each column corresponds to a map pixel. Each row of 
Mtp has 6 non-zero elements. The non-zero elements are ±1 in the columns corresponding 
to the observed pixels in the I map; and ± cos 27,4, ± sin 274, ± cos 2^b, and ± sin 2^b for 


d\ + e?2 
2 

d\ — ^2 


4/26 


the Q and U maps. The plus and minus signs are chosen according to equations (3) and (4). 
Each observation is associated with 12 non-zero values of Mt p that are distributed in two 
rows. 

In reality, there are two dominant nonidealities in radiometers that must be taken into 
account. One is the “bandpass mismatch.” We take the difference between two radiometers 
to measure polarization. While these radiometers were designed and built to have nearly 
identical frequency responses (bandpass) to the incoming electromagnetic waves, a slight 
mismatch in the bandpass produces a spurious polarization signal even in the absence of 
polarization. Suppose that the incoming waves are unpolarized and have a spectrum of 7(i/). 
Due to the bandpass mismatch of two radiometers, they receive the incoming waves at slightly 
different effective frequencies, iq and iq. As a result, the difference between two radiometers 
does not vanish, producing a spurious polarization, s, given by s = /(zq) — I(v 2 ) ~ (zq — 
1*2)81/ dv. While the CMB, whose temperature does not depend on frequencies, does not 
produce a spurious polarization, the other components (such as Galactic emission) that 
depend on frequencies do produce a spurious polarization. 

Fortunately, it is relatively straightforward to remove this effect. Equations (3) and (4) 
show that the real polarization signals are modulated by the angle 7 . On the other hand, a 
spurious polarization is independent of 7 . Therefore, we can separate the real and spurious 
polarization signals if we have enough coverage in 7 . We modify equations (3) and (4) as 
(equations 17 and 18 of [18]) 

d\ = Ia + Qa cos 27 a + Ua sin 2 qq + sa~ Ib ~ Qb cos 2 ~/ b ~ U B sin 27 ^ - s B , ( 8 ) 
d 2 = 1a - Qa cos 27 A - Ua sin 2 qq - sa~ Ib + Qb cos 275 + U B sin 2 q B + s B , (9) 

and expand the mapping matrix to a 2 A/ x 4 N p matrix. (Each row has 8 non-zero elements.) 
While WMAP’s scan pattern allows for a uniform coverage in 7 near the ecliptic poles, 
it covers only 30% of possible 7 on the ecliptic plane. This produces noisy modes in the 
reconstructed sky maps, which must be properly de-weighted. (By comparison, Planck’s 
coverage is < 4% on the ecliptic plane.) 

The second non-ideality is the “transmission imbalance,” which is the difference between 
the A and B sides; namely, the A and B sides do not necessarily have equal responses to 
the incoming waves due to loss (i.e. , imperfect transmission) in the system. The spurious 
polarization is an additive effect, but the transmission imbalance is a multiplicative effect, 
given by (equations 19 and 20 of [18]) 

d\ = (1 + Xi m )[I A + Qa cos 27 a + U A sin 2 qq + s a] 

-(1 - x irn )[I B + Q B cos 27 b + U B sin 27 # + s B ], ( 10 ) 

d 2 = (1 + Xim)[lA ~ Qa COS 27 A - U A sin 2qq - sq] 

-(1 - Xim)[I B - Q B cos 27 s - U B sin 27 # - s B ], (11) 

where Xi m is the transmission imbalance factor, which has been measured using the responses 
of radiometers to the CMB dipole (table 2 of [18]). We include the transmission imbalance 
in the mapping matrix by multiplying the A- and B-side elements by 1 + Xi m and 1 — x i m , 
respectively. 
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The optimal estimator for a sky map, m pi that is unbiased and has the minimum variance, 
is given by (in matrix notation) 


fh = (M T N- 1 M)- 1 M T N~ 1 d, 


( 12 ) 


where N (= JV#/) is the noise matrix of the TOD. The TOD noise of WMAP is stationary, 
in a sense that the noise matrix is a function of At = 1 1 — t'\. The noise matrix is given by 


AT 1 (At) 


iVf 1 0 \ 

0 N^ 1 ) ’ 


(13) 


with (equation 21 of [18]) 


Nr 1 (At) 


C {/ e iuAt [/ e iajt ' Ni(t')dt']- l du ; + K } , 

0 , 


A t <c Af max , 
At > A t max , 


(14) 


where At is in units of samples, and A f max is the time lag at which the TOD noise correlation 
function (iVj(t) in the integral) crosses zero, typically ~ 600 seconds. The coefficients, C and 
K , are chosen such that N^ 1 ( 0) is normalized to unity, and that the mean of AT 1 (At) over 
0 < At < A t max vanishes. The TOD noise correlation function, Ni(t), is measured from the 
data and a functional form is given in equation (4) of [18]. Given this noise matrix of the 
TOD, we solve the equation, {M T N~ l M)rh = M T N~ 1 d , to find a sky map solution, fh, 
using the conjugate gradient method. 

As described in section 2.1, we use the CMB dipole to convert the input signals (in 
voltages) to thermodynamic temperatures. The uncertainty in this conversion (calibration 
uncertainty) per DA is 0.2% for the final nine-year temperature and polarization maps, which 
has unchanged since the five-year analysis (section 4 of [23]). Figure 2 shows the nine-year 
solutions to the full-sky / maps (minus dipole anisotropy), while figures 3 and 4 show the 
Stokes Q and U maps, respectively, in five frequency bands. 


3. Galactic and extra-galactic foreground emission 

3. 1 . Temperature 

Figure 2 shows that the distribution of the measured temperatures at high frequencies (41, 
61, and 94 GHz) at high Galactic latitudes are quite similar. This means that, at these 
frequencies, the temperature data at high Galactic latitudes are dominated by the CMB 
which is independent of frequencies. On the other hand, the data at lower frequencies (23 
and 33 GHz) are clearly affected by strong Galactic emission, and the data near the Galactic 
plane are dominated by the Galactic emission at all five frequencies. Also, there are many 
extra-galactic sources (most of which are synchrotron sources) all over the sky, which need 
to be removed from the cosmological analysis. 

To reduce the effects of the so-called “foreground” emission from our Galaxy and extra- 
galactic sources, we combine two methods: one is to simply mask the pixels which are strongly 
affected by the foreground emission; and the other is to estimate and remove the foreground 
emission from the sky maps. 

Three Galactic foreground components are known to dominate in the WMAP frequencies 
(section 3 of [5]): synchrotron, free- free, and dust. The antenna temperatures of these three 
components go (very approximately) as oc i/^ 3 , z/~ 2 , and v 2 , respectively. As a result, syn- 
chrotron dominates at the lowest frequencies (23 and 33 GHz), free- free dominates in some 
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Fig. 2 Full-sky nine-year temperature maps in five frequency bands measured by WMAP 
(adapted from [37]), shown in Galactic coordinates. Dipole anisotropy has been removed 
from the maps. Maps are smoothed by a 0.2 degree Gaussian to suppress noise. 


regions in the sky at the intermediate frequencies (41 and 61 GHz), and dust dominates at 
the highest frequency (94 GHz). 

We define the mask for the temperature data as follows (section 2 of [25]). We first smooth 
the 23 GHz (K band) map to one degree resolution, and remove an estimate of the CMB 
from this map. The CMB is estimated by the internal linear combination (ILC) method 
(section 5.2 of [19]). We then mask the pixels brighter than a certain threshold temperature, 
until 75% or 85% of the sky is left unmasked. We repeat the same procedure for the 41 GHz 
(Q band) map. The masks defined by the K- and Q-band maps are added to form two masks, 
“KQ75” and “KQ85” masks, depending on how much sky was left unmasked in the K- or 
Q-band map. 

The mask for extra-galactic sources is created using the locations of the known bright 
radio sources in the literature (as described in section 7 of [5]), the sources found in the 
WMAP nine- year data (section 5.2.2 of [37]), and additional sources found in the Planck 
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Fig. 3 Full-sky nine-year Stokes Q maps in five frequency bands measured by WMAP 
(adapted from [37]). Maps are smoothed to a common Gaussian beam of 2 degrees to suppress 
noise. See figure 1 for the correspondence between the signs of Q and polarization directions. 


early release compact source catalog at 100 GHz [42]. An exclusion radius of 1.2 degree is 
used for sources brighter than 5 Jy in any WMAP band, and an exclusion radius of 0.6 
degree is used for fainter sources. 

To further reduce foreground emission in unmasked pixels, we estimate the distribution of 
the diffuse Galactic foreground emission over the full sky and remove it from the WMAP 
maps. We use the difference between the 23 and 33 GHz maps (which does not contain the 
CMB) for synchrotron emission; a map of Ha [43] corrected for extinction and scattering 
(section 5.3.1 of [37]) for free-free emission; and a map of dust emission [44], These three 
maps are simultaneously fit to and removed from the 41, 61, and 94 GHz maps, yielding the 
foreground- reduced maps at these frequencies. 

Finally, we slightly enlarge the KQ75 and KQ85 masks by further masking the regions 
which have significant excess in the differences between the foreground-reduced 41 and 
61 GHz maps, and 61 and 94 GHz maps. The resulting nine-year temperature masks, 
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Fig. 4 Same as figure 4 but for Stokes U. 

“KQ75y9” and “KQ85y9,” retain 68.8% and 74.8% of the sky, respectively. The former 
is used for testing Gaussianity of the temperature data, while the latter is used for the 
power spectrum analysis. 

3.2. Polarization 

Figures 3 and 4 show that the polarization maps at low frequencies are dominated by the 
Galactic emission. We thus mask the regions strongly contaminated by foreground emission, 
and remove an estimate of the foreground emission from unmasked pixels. 

Two polarized foreground components are known to dominate in the WMAP frequencies 
[20]: synchrotron and dust. Therefore, one might think that reducing foreground in polar- 
ization is easier than in temperature, as we have fewer foreground components. In reality, 
the polarization analysis is more challenging because the CMB signal in polarization is 10 
times fainter than in temperature. While the polarized foreground emission is clearly seen 
in figures 3 and 4, there is no clear evidence for the CMB signals; thus, we need more 
sophisticated statistical analysis to extract faint polarization signals of the CMB. 
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We define the mask of polarization maps using the 23 GHz map (which is dominated by 
synchrotron) and a model of dust emission [20]. We first create lower resolution Q and U 
maps at the 23 GHz, each containing 3072 pixels. We then compute the total polarization 
intensity, \J Q 2 + U 2 . As the presence of noise produces a small positive bias in this quantity, 
we remove it using our estimate of noise in the maps. Finally, we mask the pixels that are 
brighter than a certain threshold polarization intensity. We choose the threshold to be 0.6 
times the mean polarization intensity, and call this mask the “P06” mask. As for dust, we 
use a map of dust emission constructed from the WMAP temperature data using Maximum 
Entropy Method (MEM) (section 5 of [5]). We choose a threshold in this map to be 0.5 
times the maximum value found in the polar caps (| 6 | >60 degrees), and mask the pixels 
brighter than this threshold value. We then add this dust mask to the P06 mask. We find 
that extra-galactic sources are minimally polarized in the WMAP frequencies; thus, we only 
mask ten bright sources outside of the P06 mask [20, 37]. The combined P06 mask including 
synchrotron, dust, and extra-galactic sources retains 73.2% of the sky for the cosmological 
analysis. 

To further reduce foreground emission in unmasked pixels, we use the 23 GHz polarization 
maps to trace synchrotron. As for dust, we use (equation 15 of [ 20 ]) 

Qdust(fi) — Gust (h)l/<iust (h) cos [2>v]ust (d)] , (15) 

Hdust(fl) — G us t (n) (/just; (if) sin[2y c i us t (n)] , (lb) 

where G ust is the same dust map that we used for the temperature analysis [44], Dust 
emission is polarized because dust grains are not spherical, and are aligned with coherent 
magnetic fields in our Galaxy such that the semi-major axes of grains are perpendicular 
to the fields. As a result, the polarization directions of dust emission are perpendicular 
to the field directions. The dust polarization we observe along a particular line of sight is 
the projection of multiple polarization signals along the line of sight, which is affected by 
geometry of the fields. We take this projection effect into account by the function gdust) 
which is calculated using a simple model of fields given in section 4.1 of [20]. 

How do we estimate 7 dust? As the polarization directions of synchrotron are also per- 
pendicular to the field directions, we could take the polarization directions at 23 GHz as an 
estimate for 7 dusP however, alignment of dust grains with the fields is not necessarily perfect, 
which yields some differences between 7 dust and 7 syn ch- Therefore, we use the polarization 
directions measured toward stars. While the intrinsic starlight is usually unpolarized (or 
polarized very weakly), the observed starlight can be polarized due to selective extinction 
of the starlight when it passes through regions with dust grains. As it gets more extinc- 
tion along the semi-major axes of dust grains, the observed starlight polarization direction 
is precisely orthogonal to the polarization direction of dust emission from the same loca- 
tion. We have compiled the existing measurements of starlight polarization in the literature 
(section 4.1.2 of [20]), and created a map of the starlight polarization directions, 7 *(n). 
We then compute the dust polarization direction as 7 dust(fl) = 7 *(n) + tt/2. The correlation 
between 7 a U st computed in this way and 7 syn ch shows that these two angles typically agree 
to 20 degrees. 

We simultaneously fit the 23 GHz polarization maps and dust polarization maps to reduce 
polarized foregrounds from the higher frequency maps at 33, 41, 61, and 94 GHz. Due to much 
lower signal-to-noise ratios in the polarization maps, removal of the polarized foreground 
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presents a great challenge to the WMAP analysis. The error in the foreground removal has a 
non-negligible impact on the inferred value of the optical depth to the Thomson scattering, 
t. For example, an alternative foreground removal method presented in [30] shifts the value 
of r as much as the 1 -o statistical uncertainty. 

4. Power spectrum measurements 

The steps we have described so far give us foreground-reduced temperature maps at 41, 61, 
and 94 GHz, and foreground-reduced polarization maps at 33, 41, 61, and 94 GHz. The 
temperature maps come with the KQ75 and KQ85 masks depending on the purpose of the 
cosmological analysis, and the polarization maps come with the P06 mask. In this section, 
we describe how to measure the angular power spectra from these maps. 


4-1. Temperature 

Assuming that the distribution of CMB temperatures in the foreground-reduced maps out- 
side the mask is given by a Gaussian distribution, which is a good approximation as described 
in section 6, a complete description of the measured CMB temperatures is given by the 
following probability density distribution function (PDF): 


p(T) 


exp 


-Worths + N)r b ST^ 


-(6)1 


^/det[27r(S' + N)] 


(17) 


where ST^ = T) a — T cm b is the difference between the CMB temperature toward a direction 
(or a sky pixel) i and the mean CMB temperature, T cm b = 2.725 K, with an extra index a 
denoting a DA and an observed period (e.g., first year, second year, etc). The total covariance 
matrix consists of the signal matrix, Si a jb, and the noise matrix, Wajb . These matrices are 
Mpi x MDA-Myear x M P [ x Mda M ye&r matrices, where M p \ x is the number of pixels, Mda = 6 is 
the number of DAs used for the temperature analysis (2 at 61 GHz and 4 at 94 GHz), and 
-ZHyear = 9 is the number of years. For WMAP, the noise matrix vanishes unless a = b, i.e., 
noise is uncorrelated between different DAs or years. 

As WMAP is a differential experiment measuring temperature differences between two 
points separated by 141 degrees, there is a pixel-to-pixel noise correlation at 141 degrees 
(figure 11 of [10]). However, this correlation has a negligible influence on the temperature 
analysis, as the signal covariance due to the CMB totally dominates at large angular scales 
where this pixel correlation is important. We thus have a simple description of the noise 
matrix: 


Niajb 


_ L^O 


(“)l2 


n 


(a) 

obs,z 


vij $ ab 5 


(18) 


where Uq 1 ' 1 sets the overall noise level per DA per observation, and n^ s i gives the corre- 
sponding effective number of observations in a sky pixel i. We also set the noise matrix to 
have a large value in the masked pixels, i.e., we effectively set the noise level to be infinity 
at the masked pixels. 

The CMB signal matrix is given by 


S ia jb = ^ + Wibfbf P t (c08 9ij) t 

t 


(19) 
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where Ci is the temperature power spectrum of the CMB, and b[ a> is the so-called “beam 
transfer function,” which is the Legendre transform of a symmetrized beam profile of a given 
DA, and Pi(x) is the usual Legendre polynomials. An additional smearing due to pixelization 
may also be included in b \ a ' > . 

The beam transfer functions of all DAs are determined by the repeated observations of 
Jupiter and the full physical modeling of the optical system of WMAP [7, 24], Two Jupiter 
observing seasons of ~ 50 days each occur every 395-400 days, and we had 17 seasons of 
the Jupiter data over nine years of operation. The Jupiter data are used to determine the 
inner beam profiles directly out to radii at which the antenna gains drop to 2, 3, 5, 6, 
and 9 dBi (or —45.0, —46.4, —46.3, —48.8, and —48.8 dB) at 23, 33, 41, 61, and 94 GHz, 
respectively (section 3 of [37]). Then, the physical optics modeling is used from the inner 
beams out to 7.0, 5.5, 5.0, 4.0, and 3.5 degrees from the beam center at 23, 33, 41, 61, and 
94 GHz, respectively. The beam response at greater distances from the beam center (i.e., far 
sidelobes) has been measured on the ground before launch and in-flight using Moon [9]. 

The accurate determination of the beam transfer functions is crucial for the accurate 
recovery of the intrinsic CMB power spectrum, Ce, as the errors we make in b ^ propagate 
directly into C/. We estimate that the l-a uncertainty in the recovered C/ from the nine- 
year data due to the uncertainty in the beam transfer functions is 0.6% at ^ > 100. This 
uncertainty is coherent (correlated) over a wide range in t , and must be included in the 
cosmological analysis of the CMB power spectrum. 

How do we infer cosmological parameters from the CMB maps? We can calculate Cg as 
a function of cosmological parameters using the linear Boltzmann code such as CMBFAST 
[45], CAMB [46], and CLASS [47]. Ideally, we wish to evaluate equation (17) directly as a 
function of cosmological parameters given our knowledge of noise, mask, and beam transfer 
functions; namely, we interpret equation (17) as the likelihood function of temperature data 
given cosmological parameters, p(T\9), where 9 denotes a set of cosmological parameters. 
We then use Bayes’ theorem to obtain the posterior probability of the parameters given the 
temperature data as p(9\T) oc p(T\9)p{9). Here, p(9) is the prior probability of cosmological 
parameters, and we take it to be uniform within a certain reasonable range of 9. We then 
calculate the best-fit values of 9 and the 68% confidence intervals, etc. 

Another approach is to use equation (17) as the likelihood function of temperature data 
given power spectra, p(T\Cg), and obtain the posterior probability as p(Cg\T) oc p(T\Cg)p(Cg) 
with uniform p{Cg) within a certain reasonable range of C We then evaluate p{Cg\T) 
for theoretically computed Cg with various values of 9, and find the best-fit values of 9 
and the 68% confidence intervals, etc. In other words, we write the posterior probability 
of 9 as p(9\T) oc p(T\9)p(9) = f dC £ p(T\C t )p(C e \0)p(0), where p{C t \B) = 8[C e - Cf eory (0)] 
because we know how to calculate Cg theoretically as a function of 9. 

The difference between these two approaches is that the latter approach produces a con- 
venient intermediate product, p(Cg\T), which can be made much faster to evaluate than the 
full likelihood function using the so-called “Blackwell- Rao (BR) estimator” [48]. Note that 
the form of p(Cg\T) is non-Gaussian even though p(T\Cg) is a Gaussian (equation 17), as Cg 
is a quadratic function of temperatures. The central limit theorem makes p{Cg\T) closer to 
a Gaussian distribution for large values of £, but it is important to use the full non-Gaussian 
form of p(Cg\T) at small values of i for the cosmological parameter estimation. 
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While it is certainly possible to obtain p(Cg\T) using the BR estimator out to large values 
of l [49, 50], the computational cost is still quite substantial. Therefore, the WMAP team 
has adopted a hybrid approach: we use the BR estimator to compute the full p(Cg\T) at 
l < 32 from the ILC map with the KQ85 mask (following section 6 of [37]), and use the 
so-called “quadratic maximum likelihood (QML) estimator” [51, 52] for t > 32. The QML 
estimator, Cg. is obtained by Taylor-expanding the logarithm of equation (17) up to second 
order in Cg — Cg, and maximizing it with respect to Cg, i.e. , dlnp(T\Cg) /dCg\g = 0. The 
solution is 


c t = 

£' 


1 

2R + 1 


Wh (a) d (b) * 

ab m' 


(20) 


where is the spherical harmonics coefficient of a map filtered by (S + N) x , af^ = 
f d 2 h Yl m (n)[(S + N)" 1 5T]( a \h) . The matrix Fgg’ is given by 


Fgg’ 


d 2 In p \ 

dC t dC e , / 


(2f+ 2(4lp + 1) (s + + N)-£ w b^p e (cos o ly )b?\ 


( 21 ) 


where the repeated indices are summed. 

The QML estimator gives our best estimate for Cg at each l if we use the correct Cg in 
the S matrix. We can therefore improve the performance of the estimator by iterating the 
estimation: assume some reasonable Cg, estimate Cg, use the estimated Cg to recompute the 
QML estimator, and repeat. If an incorrect Cg is used, the QML estimator does not give the 
minimum variance, but it is still unbiased. 

While the QML estimator gives an estimate, we still need to calculate the form of the 
posterior distribution of Cg. We know that a Gaussian approximation is not accurate enough; 
thus, we combine a Gaussian distribution and a log-normal distribution with appropriate 
weights to obtain an improved form of the posterior distribution (following section 2.1 of 
[15]). We use the nine-year temperature data at 61 and 94 GHz, which have the highest 
angular resolutions, to compute Cg. 

Figure 5 shows the nine-year measurements of Cg along with estimates of the 68% CL 
error bars. The error bars are calculated as follows. Given the form of p(Cg\T), we calculate 
the second-order moment (variance) of Cg, and parametrize it as 


(SCt) = / dCg Cjp(Cg\T) - 


dCg Cgp{Cg\T) 


_ 2 (Cg + Ng) 2 

= W + l)/sV 


(22) 


where Ng shows the contribution from instrumental noise and a parameter / s k y ,^ may be 
regarded as the effective fraction of sky used for the analysis at each l. We know how to 
calculate Ng from the known properties of noise and beam transfer functions; thus, for a given 
value of Cg , the only unknown quantity is / s ky,t- Equation (22) thus provides definition of 
fsky,e, which is a slowly-varying function of l (section 2.2.1 of [15]). This equation shows that 
there is an irreducible uncertainty even in the absence of noise, 2C 2 /\(2t + 1 )f 2 ky g]. This is 
the so-called “cosmic variance” term, which arises from the fact that Cg is variance of CMB 
temperatures, and only 2£ + 1 samples are available for estimating variance at each Z. 
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Fig. 5 Nine-year angular power spectrum of the CMB temperature (adapted from [37]). 
While we measure Cg at each l in 2 < l < 1200, the points with error bars show the binned 
values of Ct for clarity. The error bars show the standard deviation of C/ from instrumen- 
tal noise, [2{2CpN{ + Nf)/{ 2i + l)/ 2 ky J 1 / 2 . The shaded area shows the standard deviation 
from the cosmic variance term, [2Cf/(2£ + l)/ sky ^] 1//2 (except at very low t where the 
68% CL from the full non-Gaussian posterior probability is shown). The solid line shows 
the theoretical curve of the best-fit ACDM cosmological model. 


4-2. Polarization 

The polarization analysis is similar to the temperature analysis. We begin with a Gaussian 
PDF for temperature and polarization: 


p(m) 


exp 


“3 Ejj E„l™i a, (S + N)r b m 


•?> 


y det[27r(5 + N)] 


(23) 


where m = (5T, Q, U), and the signal matrix contains all the power spectrum combinations 
such as Cj T , Cj E , Cf E , and Cf B (as well as parity-violating combinations, Cf B and Cf B , 
if necessary). The explicit expressions are given in appendix of [53]. 

The TE power spectrum does not add much to the parameter constraints but is included 
in the model fits. The most important information we obtain from the polarization likelihood 
is the optical depth, r, from the EE power spectrum at l < 10. We can evaluate the exact 
likelihood function given by equation (23) for such low multipoles, using the steps described 
in appendix D of [20]. More precisely, we use equation (23) to calculate the likelihood using 
the data at i < 23. We use the polarization maps at 33, 41, 61, and 94 GHz, while we use 
the ILC map for the temperature. Figure 6 shows the likelihood of the EE power spectrum, 
+ l)Cf E /(2n), for t = 2 through 7. 
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Fig. 6 Likelihood functions of the nine-year EE power spectrum for £ = 2 through 7 
obtained from equation (23) (adapted from [37]). These data fix the optical depth, r. The 
diamonds show the theoretical values of the best-fit ACDM cosmological model. 


It is still useful to compute the power spectrum of TE at high multipoles. For this we 
use a simplified approach: we do not weight the temperature maps at 61 and 94 GHz, 
while we weight the polarization maps at 41, 61, and 94 GHz by crg/n 0 b s ,i- We then compute 
(2£ + 1) 1 Yh m a J m a fmi and deconvolve the effects of the mask and weight following appendix 
A of [13]. 

Figure 7 shows the nine-year measurements of Cj E along with estimates of the 68% CL 
error bars. The error bars are calculated as follows: 


{(8df E ) 2 ) 


0 Cf T + Nj T ){Cf E + N ee ) + (I Cj E ) 2 

{ 2 £ + 1)/ S ky^/ S ky^ 


(24) 


where Nj T and N EE are the noise bias spectra of the temperature and E-mode polarization, 
respectively, and / s ^ y ^ and are the effective sky fractions of the temperature and 

E-mode polarization data, respectively. 
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Fig. 7 Nine-year angular cross power spectrum of the CMB temperature and E-mode 
polarization (adapted from [37]). While we measure Cj E at each i in 2 < £ < 1000, the points 
with error bars show the binned values of Cj E for clarity. The error bars show the standard 
deviation of Cj E , which include both the instrumental noise and the cosmic variance. The 
solid line shows the theoretical curve of the best-fit ACDM cosmological model. 


As the temperature and E-mode polarization are correlated, we can create images of El- 
mode polarization around temperature spots by averaging the polarization data around hot 
and cold spots. Figure 8 shows average images of temperature and polarization data. We 
find that the polarization data around hot and cold spots exhibit radial and tangential 
polarization patterns, as predicted by simulations. What is the physics behind them? 

First of all, the necessary and sufficient conditions for generating non-zero polarization 
of the CMB are to have Thomson scattering and quadrupolar temperature anisotropy 
around an electron. Frequent Thomson scatterings between photons and electrons suppress 
quadrupolar temperature anisotropy around an electron, and thus we need to wait until the 
photon decoupling epoch (at which photons and electrons become less strongly coupled) to 
produce polarization. How is then quadrupolar anisotropy around an electron created? 

It turns out that polarization (for scalar modes) traces a velocity gradient field of the 
plasma around gravitational potentials. Suppose that a packet of the plasma is falling into 
the bottom of the potential well. Due to acceleration, a velocity gradient is generated: the 
front of the packet falls faster than the back of the packet. Therefore, an electron at the 
center of the packet observes redshifted photons from both the front and back of the packet, 
whereas there is no redshift or blueshift from the sides of the packet. This produces a 
quadrupolar radiation pattern (colder along the motion of the packet and hotter in the 
perpendicular directions), and the produced polarization is parallel to the motion of the 
packet. The polarization pattern around a spherically symmetric gravitational potential well 
is radial , and the magnitude of radial polarization is maximal at twice the sound horizon 
radius at the decoupling epoch (or 1.2 degrees in the sky) from the bottom of the potential 
well [35]. As the packet approaches the bottom of the potential well, the packet decelerates 
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Fig. 8 Average images of temperature and polarization data. 12387 hot spots and 12628 
cold spots are found in the WMAP seven-year temperature maps, and the average images 
of hot and cold spots are shown in the top panels along with the corresponding simu- 
lated images. The bottom panels show the average images of the polarization maps around 
the locations of hot and cold temperature spots, as well as the corresponding simulated 
images. The size of each image is 5° by 5°. The lines show the polarization directions, 
and their lengths are proportional to the magnitude of polarization. The colors of the 
polarization images are chosen such that blue and red show the tangential and radial 
polarization patterns, respectively. The data show the predicted tangential and radial polar- 
ization patterns (F-mode polarization), in excellent agreement with the predictions. The 
maximum of radial polarization around hot spots occurs at 1.2 degrees from the center, 
whereas the maximum of tangential polarization around hot spots occurs at 0.6 degrees from 
the center. Figure adapted from http://wmap.gsfc.nasa.gov/media/101079/index.html 
(Credit: NASA/WMAP Science Team). 


because of a pressure gradient. In the adiabatic initial condition, the photon density is high 
at the bottom of the potential well, producing a pressure gradient to decelerate motion 
of the plasma falling into the potential well. The front of the packet falls slower than the 
back of the packet. Therefore, an electron at the center of the packet observes blueshifted 
photons from both the front and back of the packet, whereas there is no redshift or blueshift 
from the sides of the packet. This produces the opposite quadrupolar radiation pattern 
(hotter along the motion of the packet and colder in the perpendicular directions), and the 
produced polarization is tangential to the motion of the packet. The magnitude of tangential 
polarization is maximal at the sound horizon radius (or 0.6 degrees in the sky) from the 
bottom of the potential well [35]. 

These predictions have been confirmed by the WMAP polarization data. The bottom 
panels of Figure 8 show the average polarization directions measured around hot and cold 
temperature spots. On these angular scales (a few degrees), hot and cold spots correspond 
to potential wells and hills, respectively. (The high photon energy density at the bottom 
of the well overcomes the Sachs-Wolfe effect, turning potential wells into hot spots in the 
sky.) Therefore, we expect each hot spot to come with the radial and tangential polarization 
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Table 1 Six cosmological parameters of the standard flat ACDM model, determined from 
the CMB data alone. We show the constraints from the WMAP nine-year temperature 
and polarization power spectra [37, 38]; the WMAP data combined with the ACT [55] and 
SPT [56] temperature spectra ( ^^ WMAP+ACT+SPT , )■, and the Planck 15.5-month tem- 
perature power spectrum combined with the WMAP nine-year polarization power spectrum 
( “Piandc+WP” ) [57]. 


Parameters 

WMAP only 

WMAP+ACT+SPT 

Planck+WP 

Q b h 2 

0.02264 ± 0.00050 

0.02229 ± 0.00037 

0.02205 ± 0.00028 

n c h 2 

0.1138 ±0.0045 

0.1126 ±0.0035 

0.1199 ±0.0027 

Da 

0.721 ±0.025 

0.728 ±0.019 

0-6851°;™ 

10 9 A| 

2.41 ±0.10 

2.167 ±0.056 

2 196 +0 ' 051 
z.iyu_ 0 060 

n s 

0.972 ±0.013 

0.9646 ± 0.0098 

0.9603 ± 0.0073 

T 

0.089 ±0.014 

0.084 ±0.013 

0 089 + °' 012 
u.uoa_ 0 014 


patterns at 1.2 and 0.6 degrees from the center, respectively, and each cold spot to come 
with the opposite patterns. As the magnitude of polarization is small, WMAP cannot detect 
polarization around each spot; however, by averaging polarization patterns around many 
spots, we can detect polarization. There are 12387 hot spots and 12628 cold spots outside the 
Galactic mask in the WMAP seven-year temperature map. Averaging the polarization data 
around these spots, the expected polarization patterns (shown in the “Simulation” columns 
in Figure 8) are clearly detected in the data (shown in the “WMAP Data” columns) [35]. 

The TE cross power spectrum and the average polarization images offer a powerful, preci- 
sion test of the standard cosmological model. We fix the basic six cosmological parameters 
by fitting the temperature power spectrum at 2 < £ < 1200 and the E-mode polarization 
power spectrum at low multipoles. We can then predict the cross power spectrum without 
any more additional free parameters. The prediction matches with the data at the precision 
shown in Figure 7 and 8. This is a great triumph of the standard cosmological model. 

The TE cross power spectrum offers also a powerful test of one of the generic predictions 
of cosmic inflation: the presence of “super-horizon” fluctuations, whose wavelength is greater 
than the horizon size at the decoupling time. This test is possible because polarization is 
generated only when there are free electrons. The reionization of the universe at z < 10 
can generate polarization only on very large angular scales, l < 10; thus, any TE signals at 
high multipoles must be generated at the decoupling epoch. The angle that subtends the 
radius of the horizon at the decoupling epoch is 1.2 degrees, which corresponds to i = 150. 
Therefore, the anti-correlation seen in the TE cross power spectrum at £ < 150 provides the 
direct evidence for the presence of super-horizon fluctuations at the decoupling epoch, i.e., 
the key prediction of inflation [16, 54], 

5. Cosmological Parameters from Power Spectra 

5.1. Standard six parameters 

The WMAP nine-year temperature and polarization data are consistent with the minimal 
six-parameter flat ACDM model [37, 38]. The high-1? temperature power spectrum (33 < £ < 
1200) gives x 2 = 1200 for 1168 degrees of freedom, with the probability to exceed (PTE) of 
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25.1%. The high-1' TE spectrum (24 < 1 < 800) gives y 2 = 815.4 for 777 degrees of freedom, 
with PTE of 16.5%. Therefore, the best-fit ACDM model is a good fit to the high-1 data. 

While the low-1 temperature likelihood (2 < 1 < 32) based on the BR estimator does not 
give a x 2 value, we find that the best-fit ACDM model is consistent with the distribution 
of spectra generated by the BR estimator. The low-1 polarization likelihood (2 < 1 < 23) 
evaluated directly in pixel space gives x 2 = 1321 for 1170 degrees of freedom with PTE of 
0.13%, which is unusually low. This excess y 2 can be interpreted as an additional noise 
component (due to, for example, residual foreground emission) of 0.27 pK per N s [d e = 8 
pixel (7.3° on a side), which is significantly lower than the average standard deviation of 

0. 86 ± 0.17 [j, K. We confirm that this excess noise does not affect the determination of t by 
using differences bewtween frequencies. See section 7.1 of [37] for details. 

We assume flat priors on the following six parameters: the amplitude of the primordial 
power spectrum at k = 0.002 Mpc -1 , A^, the tilt of the primordial power spectrum, n Sl the 
physical baryon density parameter, f , the physical CDM density parameter, Q c h 2 , the 
cosmological constant density parameter, Da, and the optical depth of the reionization, r. 

Table 1 summarizes the constraints of the cosmological parameters from the CMB data 
alone. Adding the smaller-scale CMB data from the Atacama Cosmology Telescope (ACT) 
[55] and the South Pole Telescope (SPT) improves the parameter constraints significantly. 
In particular, statistical significance of a deviation of n s from unity increases from 2.1cr 
to 3.6o\ With additional cosmological measurements (Baryon Acoustic Oscillation [58-61] 
and the local Hubble constant [62]) this improves to n s = 0.9608 ± 0.0080 (68% CL), a 4. 90- 
deviation from unity. This is a great achievement in cosmology, providing strong evidence 
for cosmic inflation. 

The parameters found from the Planck 15.5-month data combined with the WMAP low-1? 
polarization data ( “Planck+WP” ) are consistent with the WMAP and WMAP+ACT+SPT 
parameters to within the quoted error bars. With the Planck+ WP combination, 1 — n s is 
detected at 5.4o\ This is the first time that n s < 1 is detected with > 5o- from the CMB 
data alone. 

The WMAP measurements also provide definitive evidence for the existence of non- 
baryonic dark matter with Q c /Qb = 5.0 ± 0.2 (68% CL). This measurement comes from 
a combination of the ratio of the heights of odd and even acoustic peaks giving f l^h 2 and the 
ratio of the heights of the first and other peaks giving the total matter density contribut- 
ing to gravitational potential well. In other words, WMAP measures the density of matter 
which interacts with photons, and which does not. (No matter is left behind.) The difference 
between the two provides definitive evidence for non-baryonic dark matter. 

WMAP has erased lingering doubts about the existence of dark energy. This measurement 
comes from the peak positions giving the angular diameter distance, d,A = c dz/H(z ), 
where z* = 1091 is the redshift of the photon decoupling. The Friedmann equation relates 
the Hubble expansion rate, H (z), to the total energy density in the universe. As the integral is 
dominated by low redshift contributions, this provides an estimate of the total energy density 
in a local universe. As we have the complete account of matter density in the universe at any 
redshifts after z*, the difference between the total energy density inferred from cIa and the 
total matter density gives the energy density of some substance which is not even matter, 

1. e. , dark energy. Strictly speaking, this measurement is possible if we assume flatness of the 
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universe or combine the WMAP data with other cosmological data (such as the local Hubble 
constant measurements). Alternatively, we can use the effects of gravitational lensing on the 
CMB to detect dark energy from the CMB data alone [63]. 


5.2. Parameters beyond flat ACDM 

The minimal six-parameter model fits all the data we have at the moment (with a possible 
exception of the tensor-to-scalar ratio, r, which the BICEP /Keck Array collaboration claims 
to have found recently from the B-rnode polarization at degree angular scales [64]). As a 
result, the WMAP data (sometimes in combination with other CMB and non-CMB data) 
place stringent limits on the parameters beyond the minimal model. 

Spatial geometry of the universe is consistent with flat (Euclidean) space. By combining the 
WMAP+ACT+SPT with the CMB lensing data, we find = —0.001 ±0.012 (68% CL). 
When other non-CMB data (Baryon Acoustic Oscillation and the local Hubble constant 
measurements) are added, we find a stringent limit of Clk = — 0.0027^Q QQ 3 g (68% CL), i.e., 
0.4% measurement. 

Dark energy is consistent with a cosmological constant. The constraints on the equation of 
state parameter, w, are consistent with w = — 1 typically to within 10% (95% CL), depending 
on the data combinations. 

The cosmic neutrino background affects temperature anisotropy of the CMB in four ways: 
peak locations, early integrated Sachs- Wolfe effect, anisotropic stress, and enhanced damping 
tail (see section 4.3.1 of [38] for summary). Using these effects and the WMAP five-year 
data, we have made the first (indirect) detection of the cosmic neutrino background [29]. 
The CMB data give the total energy density of neutrinos, p u = (7ir 2 / 120) N e gT^, where 
N e ff is the effective number of neutrino species. Assuming the standard thermal history 
of the universe relating the asymptotic neutrino temperature to the CMB temperature as 
T„ = (4/11 ) 1 / 3 T cmb , we use the nine-year data combined with ACT and SPT to find N e g = 
3.89 ± 0.67 (68% CL), consistent with the standard value of 3.046 to within 2o\ 

The damping tail of the CMB is sensitive to the primordial helium abundance, ln e - The 
more helium we have, the more electrons are captured by helium nuclei before the decoupling, 
the fewer electrons are available at the decoupling, the more diffusion damping results. 
We have made the first detection of this effect by combining the WMAP seven-year data 
and the small-scale CMB data [35]. The nine-year data combined with ACT and SPT give 
Yne = 0.299 ± 0.027 (68% CL), consistent with the standard value of 0.25 to within 2a. These 
measurements of N e g and ln c offer a unique test of the Big Bang nucleosynthesis [65] . Our 
measurements are consistent with the standard Big Bang nucleosynthesis calculations. 

The WMAP nine- year data alone place a limit on the sum of neutrino masses, ^ m u < 
1.3 eV (95% CL). Adding the Baryon Acoustic Oscillation and the local Hubble constant 
measurements, the limit improves to < 0.44 eV (95% CL). 

Single-field inflation models predict that the fluctuations in matter and photons trace each 
other, obeying the adiabatic relation of bp m l p m = (3/4 )5p 1 / p^. We find that this relation 
holds to better than 7% (95% CL). This limit plays an important role in constraining the 
parameter space of axion dark matter models [29, 35]. 

The shape of the primordial power spectrum is sensitive to the physics of inflation. 
The “running spectral index,” dn s /dlnk, is typically predicted to be of order ( n s — l) 2 = 
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0(1O~ 3 ). The WMAP nine-year data alone give dn s /dlnk = —0.019 ± 0.025, while adding 
the small-scale CMB data improves the limit to dn s /d In k = —0.022 io'(m (68% CL), 
consistent with a power-law power spectrum to within 2a. 

Finally, inflation generates nearly scale-invariant tensor mode metric perturbations (gravi- 
tational waves) [66], h t j , which also contribute to the observed temperature and polarization 
anisotropies of the CMB. The amplitude of hij is parametrized by the “tensor-to-scalar 
ratio,” r, defined by r = 2(hijh 1 ^*) / (\1Z\ 2 ) . The WMAP data alone give r < 0.38 (95% CL), 
which improves to r < 0.17 by adding the small-scale CMB data. Adding the Baryon Acous- 
tic Oscillation measurements improves the limit further to r < 0.12 (95% CL). This limit 
largely comes from the low multipole temperature data (see section 3.2.3 of [29]), and thus it 
is sensitive to our assumption of a power-law power spectrum. Including the running index 
relaxes the limits to r < 0.43 (95% CL) with a large running index, dn s /d In k ~ —0.04, more 
or less independent of the data sets used. 

WMAP did not have sufficient sensitivity to detect B-mode polarization. Recently the 
BICEP/Keck Array collaboration claimed to have found B-mode polarization at degree 
angular scales at 150 GHz. If this signal is cosmological and originates from gravitational 
waves from inflation, it corresponds to r ~ 0.1 — 0.2 [64], As the measurement was done only 
at one frequency (the BICEP 1 data at 100 GHz are too noisy to be useful), confirmation 
of the signal at other frequencies must be made to reject foregrounds. In any case, an 
independent detection from an independent group is required before interpreting the detected 
signal as inflationary. 


6. Tests of Gaussianity with Angular Bispectrum 

Inflation predicts that primordial fluctuations originate from quantum fluctuations, and 
the distribution of primordial fluctuations is nearly a Gaussian distribution (see [67] for 
a review). Sustained inflationary expansion for at least 50 e- folds requires a field driving 
inflation to be weakly coupled. The wave function of quantum fluctuations of a scalar field 
with no interaction in the ground state is precisely a Gaussian; thus, a weakly coupled field is 
nearly a Gaussian field. The linear physics preserves Gaussianity, and thus CMB temperature 
and polarization anisotropies are predicted to obey Gaussian statistics with high precision. 
Confirmation of this prediction gives strong evidence for the quantum origin of primordial 
fluctuations. 

When the distribution is not a Gaussian, the PDF is no longer given by equation (17). How- 
ever, when a departure from Gaussianity, i.e., non-Gaussianity, is small, we may approximate 
the PDF by “Taylor-expanding” around a Gaussian distribution. Let us do this in harmonic 
space. We obtain an expanded PDF for the spherical harmonics coefficients as [68, 69] 


p{a) = 


1 ~\ E 

all iiVtij 

p 2 


i a £ 2 m 2 a £ 3 m 3 ) 


d 


d 


d 


dQ , £ 2 m 2 




/ Oup . 


X- 


(25) 


y / det(27r C) 

where Ce mj p m > = JA ■ Yi m ^S + N)ijYp, m , ■ is the signal plus noise covariance matrix in har- 
monic space. (We do not write indices for DAs or years for simplicity.) Here, the expansion 
is truncated at the three-point function (bispectrum) of ai m , and thus we have assumed that 
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the connected four-point and higher-order correlation functions are negligible compared to 
the power spectrum and bispectrum. (This condition is not always satisfied.) 

By evaluating the above derivatives, we obtain 2 


p(a) = 


^/det(27r C) 


exp 


EE ^£m (C- 1 ), 0*1' m’ 


im I'm' 


X 


1+ 6 S ( a £ 1 m 1 a £ 2 m 2 a £ 3 m 3 ) \{C CL)^ irril (C a ) £21^12 (C- a )hm 3 

all iimj 

— ‘MC )i 1 m 1 ,£ 2 m 2 {C a )^3ra 3 ] } • 


(26) 


This formula is useful, as it tells us how to estimate the angular bispectrum , 
(ai irni ae 2 rn 2 ae 3m3 ) , optimally from given data by maximizing this PDF. In practice, we usu- 
ally parametrize the bispectrum using a few parameters (e.g., /nl) 5 and estimate those 
parameters from the data by maximizing the PDF with respect to the parameters. 

In the limit that the contribution of the connected four-point function (trispectrum) to the 
PDF is negligible compared to those of the power spectrum and bispectrum, equation (26) 
contains all the information on non-Gaussian fluctuations characterized by the covari- 
ance matrix, = {a* timi ai 2m , 2 ) , and the angular bispectrum, (a £imi ai 2 m 2 ae 3 m 3 ) . This 

approach can be extended straightforwardly to the trispectrum if necessary. 

As the bispectrum has three angular wavenumbers, £\, £2 and £ 3 , it can form triangles with 
various shapes. Among all the shapes, the so-called “local-form bispectrum,” parametrized 
by a non-linear parameter /nl [71], carries a special significance, as detection of a large 
local-form bispectrum would rule out all inflation models based on a single energy compo- 
nent with a Bunch-Davies initial vacuum state and an attractor solution (see [72] for the 
latest discussion on this theorem). This triangle has the largest amplitude in the “squeezed 
configurations” in which one of the wavenumbers, say £ 3 , is much smaller than the other 
two, i.e., £3 <C £\ « £2 [73]. Detailed descriptions on what this bispectrum is and what the 
other shapes are, as well as on how to measure them can be found in [74]. 

Using the foreground-reduced WMAP nine-year temperature data at 61 and 94 GHz with 
the KQ75 mask, we find /nl = 37 ± 20 (68% CL), which is consistent with zero to within 
2<t; thus, the measurement agrees with the basic prediction of single-field inflation models 
with a Bunch-Davies initial vacuum state and an attractor solution. The Planck improves 
this limit greatly by finding /nl = 2.7 ± 5.8 (68% CL) [75]. 

One way to generate the local-form bispectrum is to write the primordial curvature per- 
turbation as 77(x) = 77 l(x) + f /nl77|(x). This form is called the “local form” because both 
sides are evaluated at the same spatial location, x. Here, TZl is a Gaussian random held, 
and the curvature perturbation is defined such that the linear Sachs-Wolfe effect gives 
ST /T = — TZl/5. Using this form and the fact that the variance of TZ is 2 x 10 -9 , we find that 
the 95% upper bound from Planck, /nl < 14, implies that the observed CMB is Gaussian to 
the precision of 0.04% or better. This is a remarkable degree of Gaussianity, which provides 
strong evidence that the observed CMB fluctuations originate from quantum fluctuations 
generated during single-held inflation. 


2 Babich [70] derived this formula for = CeSu'Smm 1 . 
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7. Implications for inflation 

Models of inflation [76-80] make specific, testable predictions. The simplest models based 
upon a single energy component (scalar field), which slowly rolls down on its potential 
and drives a sustained quasi-exponential expansion for at least 50 e-folds, predict that the 
observable universe is homogeneous and isotropic with flat geometry, and is filled with small 
fluctuations which are precisely adiabatic and nearly Gaussian (before fluctuations become 
non-linear). Both scalar and tensor fluctuations with various wavelengths are generated 
during inflation. The wavelengths of these fluctuations can exceed the horizon size at the 
decoupling epoch, and the amplitude of these fluctuations weakly depends on wavelengths. 

All of these predictions fit the WMAP data remarkably well: flatness is measured with 
0.4% precision (from WMAP combined with the Baryon Acoustic Oscillation and the local 
Hubble constant measurements); the adiabatic condition holds to better than 7% precision; 
a deviation from Gaussian fluctuations is restricted to be less than 0.2% (and 0.04% with the 
Planck 2013 data); and the presence of super- horizon fluctuations at decoupling is decisively 
detected in the TE cross power spectrum at i < 150. The WMAP data combined with the 
Baryon Acoustic Oscillation and the local Hubble constant measurements find convincing 
evidence for the scale dependence of the scalar initial power spectrum with 4.9cr significance, 
with the best-fit value in agreement with the first prediction made in [81]. 

While WMAP did not find signatures of tensor fluctuations, the upper bound on the 
tensor-to-scalar ratio inferred from the temperature data is consistent with many single- 
field inflation models. Figure 9 compares the limits on n s and r with a few representative 
single-field inflation models. 


8. Conclusion 

The nine years of observations of the WMAP satellite have taught us many things. The cur- 
rent universe is 13.77 billion years old, and consists of 4.6% atoms, 24% cold dark matter, 
and 71% dark energy [37, 38]. The nature of dark energy is consistent with that of a cosmo- 
logical constant. The spatial geometry of the universe is consistent with Euclidean geometry. 
The universe is filled with neutrinos, whose abundance is consistent with the standard model 
of particle physics. The mass of neutrinos is much less than 1 eV. 

The measured properties of primordial fluctuations such as adiabaticity, Gaussianity, and 
near scale invariance all point toward a remarkable scenario: the observed fluctuations 
originate from quantum fluctuations generated during inflation driven by a single energy 
component. WMAP offered a number of stringent tests of the simplest inflation scenarios: 
(1) flat universe, (2) adiabatic fluctuations, (3) super-horizon fluctuations, (4) nearly, but 
not exactly, scale-invariant initial power spectrum, and (5) Gaussian fluctuations. The sim- 
plest scenarios passed all of these tests. The Planck 2013 data have confirmed all of these 
findings with greater precision. 

Yet, neither the WMAP nor the Planck 2013 data detect the signature of primordial gravi- 
tational waves from inflation in CMB. Detecting and characterizing the B-mode polarization 
of the CMB is the next milestone in cosmology. While the BICEP/Keck Array collabora- 
tion claims to have found the B-mode polarization from inflationary gravitational waves at 
150 GHz, confirmation of the signal at other frequencies and with an independent experiment 
must be made before we claim a victory in observing all of the inflation predictions. 
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Fig. 9 Two-dimensional joint marginalized constraints (68% and 95% CL) on the primor- 
dial tilt, n s , and the tensor-to-scalar ratio, r. The red contours show the constraint from the 
WMAP nine-year data combined with the small-scale CMB temperature data ( ACT and 
S'PT), the Baryon Acoustic Oscillation data, and the local Hubble constant. The blue con- 
tours show the constraint from the Planck 15.5-month temperature data combined with the 
WMAP nine-year polarization data and the Baryon Acoustic Oscillation data. The symbols 
show the predictions from single-field inflation with monomial potentials, V((j)) oc cf) n [82], 
with n = 4 (black), 2 (white), and 1 (light grey), and with a R 2 term in the gravitational 
action (dark grey) [76]. 
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